A Data Mining Approach for the Prediction of Hepatitis C Virus protease Cleavage Sites
نویسنده
چکیده
Summary: Several papers have been published about the prediction of hepatitis C virus (HCV) polyprotein cleavage sites, using symbolic and non-symbolic machine learning techniques. The published papers achieved different Levels of prediction accuracy. the achieved results depends on the used technique and the availability of adequate and accurate HCV polyprotein sequences with known cleavage sites. We tried here to achieve more accurate prediction results, and more Informative knowledge about the HCV protein cleavage sites using Decision tree algorithm. There are several factors that can affect the overall prediction accuracy. One of the most important factors is the availably of acceptable and accurate HCV polyproteins sequences with known cleavage sites. We collected latest accurate data sets to build the prediction model. Also we collected another dataset for the model testing. Motivation: Hepatitis C virus is a global health problem affecting a significant portion of the world’s population. The World Health Organization estimated that in1999; 170 million hepatitis C virus (HCV) carriers were present worldwide, with 3 to 4 million new cases per year. Several approaches have been performed to analyze HCV life cycle to find out the important factors of the viral replication process. HCV polyprotein processing by the viral protease has a vital role in the virus replication. The prediction of HCV protease cleavage sites can help the biologists in the design of suitable viral inhibitors. Results: The ease to use and to understand of the decision tree enabled us to create simple prediction model. We used here the latest accurate viral datasets. Decision tree achieved here acceptable prediction accuracy results. Also it generated informative knowledge about the cleavage process itself. These results can help the researchers in the development of effective viral inhibitors. Using decision tree to predict HCV protein cleavage sites achieved high prediction accuracy. Keywords-component; HCV polyprotein; decision tree; protease; decamers
منابع مشابه
Reduced Bio-basis Function Neural Networks for Protease Cleavage Site Prediction
This paper presents a new neural learning algorithm for protease cleavage site prediction. The basic idea is to replace the radial basis function used in radial basis function neural networks by a so-called bio-basis function using amino acid similarity matrices. Mutual information is used to select bio-bases and a corresponding selection algorithm is developed. The algorithm has been applied t...
متن کاملIn vivo selection of protease cleavage sites by using chimeric Sindbis virus libraries.
Identifying protease cleavage sites contributes to our understanding of their specificity and biochemical properties and can help in designing specific inhibitors. One route to this end is the generation and screening of random libraries of cleavage sites. Both synthetic and phage-displayed libraries have been extensively used in vitro. We describe a novel system based on recombinant Sindbis vi...
متن کاملActivity of purified hepatitis C virus protease NS3 on peptide substrates.
The protease domain of the hepatitis C virus (HCV) protein NS3 was expressed in Escherichia coli, purified to homogeneity, and shown to be active on peptides derived from the sequence of the NS4A-NS4B junction. Experiments were carried out to optimize protease activity. Buffer requirements included the presence of detergent, glycerol, and dithiothreitol, pH between 7.5 and 8.5, and low ionic st...
متن کاملIdentification of Aptamer-Binding Sites in Hepatitis C Virus Envelope Glycoprotein E2
Hepatitis C Virus (HCV) encodes two envelope glycoproteins, E1 and E2. Our previous work selected a specific aptamer ZE2, which could bind to E2 with high affinity, with a great potential for developing new molecular probes as an early diagnostic reagents or therapeutic drugs targeting HCV. In this study, the binding sites between E2 and aptamer ZE2 were further explored. E2 was truncated to 15...
متن کاملUsing Boehmite Nanoparticles as an Undercoat, and Riboflavin as a Redox Probe for Immunosensor Designing: Ultrasensitive Detection of Hepatitis C Virus Core Antigen
In this study a label-free electrochemical Immunosensor for ultrasensitive detection of Hepatitis C virus core antigen in serum samples was fabricated by using a simple approach. In this method a low-cost and sensitive immunosensor was fabricated based on a boehmite nanoparticles (BNPs) modified glassy carbon. The BNPs provide a specific platform with increased surface area which is capable of ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012